Kevin Reick IBM TO ACHIEVE RELIABILITY GOALS

نویسندگان

  • Douglas C. Bossen
  • Joel M. Tendler
  • Kevin Reick
چکیده

Fault-tolerant computing is a mature art whose techniques have migrated from mainframe computers to other product classes. This migration has involved tradeoffs between failure probabilities, defined availability requirements, performance implications, and product cost. At IBM, we have incorporated fault tolerance in designing Power4 systems—servers comprised of several Power4 chips. The Power4 is an integrated system on a chip (SoC) designed for systems that initially target enterprise Unix customers and at a later date O/S 400 customers. These systems are critical to the successful operation of many organizations and operate 24 hours a day, 7 days a week. Such a high degree of reliability, availability, and serviceability (RAS) is difficult to provide in a highly integrated SoC like the Power4 because traditional error diagnosis techniques become less workable. Therefore, achieving RAS targets for Power4based systems requires a steady migration of IBM’s mainframe S/390 RAS into Power4 system hardware and software.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Fault-tolerant design of the IBM pSeries 690 system using POWER4 processor technology

The POWER4-based p690 systems offer the highest performance of the IBM eServer pSeries line of computers. Within the general-purpose UNIX server market, they also offer the highest levels of concurrent error detection, fault isolation, recovery, and availability. High availability is achieved by minimizing component failure rates through improvements in the base technology, and through design t...

متن کامل

Policy-Based Autonomic Storage Allocation

The goal of autonomic storage allocation is to achieve allocation of storage resources, their performance monitoring, and hotspot elimination by specifying comparatively high-level goals, rather than by means of low-level manual steps. The process of automation should allow specification of policies as administrator specified constraints under which the resources are managed. This paper describ...

متن کامل

Reliability based budgeting with the case study of TV broadcast

Planning budget will help to identify wasteful expenditures, adapt financial situation changes quickly, and achieve financial goals. The reliability based budgeting has a great importance for broadcasting industry. In this study, several kinds of failure modes in TV broadcasting system have been det...

متن کامل

Enterprise Systems Architecture/370: An Architecture for Multiple Virtual Space Access and Authorization

The Enterprise Systems Architecturel370" provides a significant step in the IBM System/370 evolution by providing new capabilities for virtual addressing and program linkage across multiple address spaces. This paper reviews the evolution that led to this advance and illuminates the goals, such as eliminating growth constraints and improving security, integrity, reliability, and performance, th...

متن کامل

Explaining the Model of Social Responsible Organization from the Perspective of Academic Experts

Purpose: The purpose of this study was to design a model of social responsibility and explain the framework of social responsibility based on the ISO 26000 standard in order to achieve sustainable development from the perspective of academic experts. Methodology: This research is applied in terms of purpose and is descriptive-survey in terms of data collection. The statistical population in th...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002